Sparse modeling of posterior exemplars for keyword detection
نویسندگان
چکیده
Sparse representation has been shown to be a powerful modeling framework for classification and detection tasks. In this paper, we propose a new keyword detection algorithm based on sparse representation of the posterior exemplars. The posterior exemplars are phone conditional probabilities obtained from a deep neural network. This method relies on the concept that a keyword exemplar lies in a low-dimensional subspace which can be represented as a sparse linear combination of the training exemplars. The training exemplars are used to learn a dictionary for sparse representation of the keywords and background classes. Given this dictionary, the sparse representation of a test exemplar is used to detect the keywords. The experimental results demonstrate the potential of the proposed sparse modeling approach and it compares favorably with the state-of-the-art HMM-based framework on Numbers’95 database.
منابع مشابه
Subspace Detection of DNN Posterior Probabilities via Sparse Representation for Query by Example Spoken Term Detection
We cast the query by example spoken term detection (QbESTD) problem as subspace detection where query and background subspaces are modeled as union of low-dimensional subspaces. The speech exemplars used for subspace modeling are class-conditional posterior probabilities estimated using deep neural network (DNN). The query and background training exemplars are exploited to model the underlying ...
متن کاملSparse Subspace Modeling for Query by Example Spoken Term Detection
We cast the problem of query by example spoken term detection (QbE-STD) as subspace detection where query and background are modeled as a union of low-dimensional subspaces. The speech exemplars used for subspace modeling consist of class-conditional posterior probabilities obtained from deep neural network (DNN). The query and background training exemplars are exploited to model the underlying...
متن کاملSparse modeling of neural network posterior probabilities for exemplar-based speech recognition
We study automatic speech recognition by direct use of acoustic features (exemplars) without any assumption on the underlying stochastic process. The prior studies exploit spectral exemplars. In this work, we present the use of neural network sub-word posterior probabilities as exemplars. The space of sub-word observations is lowdimensional (e.g. RK×T ) whereas the word transcription requires r...
متن کاملPhonetic and Phonological Posterior Search Space Hashing Exploiting Class-Specific Sparsity Structures
This paper shows that exemplar-based speech processing using class-conditional posterior probabilities admits a highly effective search strategy relying on posteriors’ intrinsic sparsity structures. The posterior probabilities are estimated for phonetic and phonological classes using deep neural network (DNN) computational framework. Exploiting the class-specific sparsity leads to a simple quan...
متن کاملیادگیری واژه نامه برای آشکارسازی محل پلاک خودرو
Car license plate detection has been always a challenging task in the context of traffic control and traffic offenses. In this paper, the problem of license plate detection from gray scale images taken in natural conditions is addressed. Our car plate database consists of images with severe imaging conditions such as low quality images, far distanced cameras, and severe weather conditions. The ...
متن کامل